The listing of claims will replace all prior versions, and listings, of claims in the 
application: 

Listing of Claims: 

Please amend claims 1, 6, 7, 17, 21, 22, 50, and 56 as indicated below. 
Please cancel claims 9 and 24. 



1 . (cxurently amended) A method of identifying molecules for production, wherein 
the molecules are represented by concatenated strings, said method comprising: 

i) encoding two or more Fetate4 biological molecules into a data structure of initial 
character strings to provide a collection of two or more different initial character strings wherein 
each of said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings firom said initial character strings; 

iii) concatenating said substrings to fonn one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adding the product strings to a data structure to populate a data structure of product 
strings; mA 

v) detennining sequence identities of the product strings relative to at least one initial 
character string; and 

vi) selecting one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

2. (previously presented) The method of claim I , wherein said encoding comprises 
encoding two or more nucleic acid sequences into said character strings. 

3. (previously presented) The method of claim 2. wherein said two or more nucleic 
acid sequences comprise a nucleic acid sequence encoding a naturally occurring protein. 

4. (previously presented) The method of claim 1 , wherein said encoding comprises 
encoding two or more amino acid sequences into said character strings. 

5. (previously presented) The method of claim 4, wherein said two or more amino 
acid sequences comprise an amino acid sequence encoding a naturally occurring protein. 
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6. (currently amended) The method of claim 1 , wherein said initial character 
strings biological mol e cul e s have at least 30% sequence identity with each other. 

7. (currently amended) The method of claim 1 , wherein said selecting in (ii) 
comprises selecting substrings at least one substring from an initial character string such that the 
ends of said substrings substring occur in string regions of about 3 to about 20 characters in the 
initial character string that have higher sequence identity with the corresponding region of 
another of said initial character strings than the overall sequence identity between the saHw two 
initial character strings. 

8. (previously presented) The method of claim 1 , wherein said selecting in (ii) 
comprises selecting substrings such that the ends of said substrings occur in predefined motifs of 
about 4 to about 8 characters. 

9. (canceled) 

10. (previously presented) The method of claim 1 , wherein said selecting in (ii) 
comprises aligning two or more of said initial character strings to maximize pairwise identity 

^ between two or more substrings of the initial character strings, and selecting a character that is a 
member of an aligned pair for the end of one of the two or more substrings. 

11. (canceled) 

12. (previously presented) The method of claim 1, wherein said method further 
comprises randomly altering one or more characters of said initial or product character strings. 

13. (original) The method of claim 12, wherein said method further comprises randomly 
selecting and altering one or more occurrences of a particular preselected character in said 
character strings. 

14. (previously presented) The method of claim 1, wherein said encoding, selecting, 
or concatenating is performed on an internet site. 

15. (previously presented) The method of claim 1 , wherein said encoding, selecting; or 
concatenating is performed on a server. 

16. (previously presented) The method of claim 1, wherein said encoding, selecting, or 
concatenating is performed on a client linked to a network. 
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17. (currently amended) A computer program product on a computer readable media 
comprising computer code that: 

i) encodes two or more biological molecules into initial character strings to provide a 
collection of two or more different initial character strings wherein each of said biological 
molecules comprises at least about ten subunits; 

ii) selects at least two mitial substrings from said character strings; 

iii) concatenates said substrings to form one or more product strings about the same 
length as one or more of the initial character string; 

iv) adds the product strings to a data structure to populate a data structure of product 
strings; dsni 

v) determines sequence identities of the product strings relative to at least one initial 
character string; and 

vi) selects one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

18. (previously presented) The computer program product of claim 17, wherein said 
two or more biological molecules are nucleic acid sequences encoding naturally occurring 
proteins. 

19. (previously presented) The computer program product of claim 17, wherein said 
two or more biological molecules are nucleic acid sequences encoding naturally occurring 
proteins. 

20. (previously presented) The computer program product of claim 1 7, wherein said 
two or more biological molecules are amino acid sequences. 

21 . (currently amended) The computer program product of claim 17, wherein said 
initial character strings biologiool mol e oul e o have at least 30% sequence identity. 

22. (currently amended) The computer program product of claim 17, wherein said 
computer code selects s ub s tring in fii) at least one substring from an initial character string 
such that the ends of said aubatringo substring occur in string regions of about three to about 
twenty characters in the initial character string that have higher sequence identity with a 



corresponding region of another of said initial character strings than the overall sequence identity 
between the two initial character substrings. 

23. (previously presented) The computer program product of claim 17, wherein said 
computer code selects substrings such that the ends of said substrings occur in predefined motifs 
of about 4 to about 8 characters. 

24. (canceled) 

25. (previously presented) The computer program product of claim 1 7, wherein the 
computer code selects substrings by aligning two or more of said initial character strings to 
maximize pairwise identity between two or more substrings of the character strings, and 
selecting a character that is a member of an aligned pair for the end of one substring. 

26. (canceled) 

27. (previously presented) The computer program product of claim 1 7, wherein said 
computer code additionally randomly alters one or more characters of said character strings. 

28. (previously presented) The computer program product of claim 27, wherein said 
computer code additionally randomly selects and alters one or more occurrences of a particular 
preselected character in said character strings. 

29. (previously presented) The computer program product of claim 17, wherein said 
computer code is stored on media selected from the group consisting of magnetic media, optical 
media, and optomagnetic media. 

30. (previously presented) The computer program product of claim 17, wherein said 
computer code is in dynamic or static memory of a computer. 

31-44. (canceled) 

45. (previously presented) The method of claim 1, wherein the initial character strings 
of(i) are related. 

46. (previously presented) The method of claim 1 , fiirther comprising physically 
screening the molecule(s) represented by the product strings for one or more desired properties. 
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47. (previously presented) The method of claim I , further comprising determining a 
computationally predicted property for molecules represented by the product strings. 

48. (previously presented) The method of claim I , wherein the molecules represented 
by the product strings are made in parallel in an array of vessels. 

49. (previously presented) The method of claim 1 , wherein the molecules represented 
by the product strings are made by assembly of oligonucleotides. 

50. (currently amended) The method of claim 1, further comprising testing members 
of the data structure of product strings for a particular property and determinmg sequence 
differences responsible for differences in the particular property on optimal oombination of 
soquenoes using multi-variate analysis. 

5 1 . (previously presented) The computer program product of claim 1 7, wherein the 
initial character strings of (i) are related. 

52. previously presented) The computer program product of claim 17, wherein the 
code instmcts physical screening of the molecule(s) represented by the product strings for one or 
more desired properties. 

53. (previously presented) The computer program product of claim 17, wherein the 
code instructs determination of a computationally predicted property for molecules represented 
by the product strings. 

54. (previously presented) The computer program product of claim 1 7, wherein the 
molecules represented by the product strings are made in parallel in an array of vessels. 

55 . (previously presented) The computer program product of claim 1 7, wherein the 
molecules represented by the product strings are made by assembly of oligonucleotides. 

56. (currently amended) The computer program product of claim 1 7, wherein the 
code tests members of the data structure of product strings for a particular property and 
detemiines sequence differences responsible for differences in the particular property on optimal 
oombination of initial s e qu e no e a using multi-variate analysis. 

57. (previously presented) A method of identifying molecules for production, wherein 
the molecules are represented by concatenated strings, said method comprising: 



i) encoding two or more related biological molecules into a data structure of initial 
character strings to provide a collection of two or more different initial character strings wherein 
each of said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings bom said initial character strings; 

iii) concatwiating said substrings to form one or more product strings; 

iv) adding the product strings to a data structure to populate a data structure of product 
strings; and 

v) determining whether the product strings have at least a predefined measure of 
similarity with at least one initial character string; and 

vi) selecting one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings determined 
to have greater than the predefined value of sequence identity with at least one initial string. 

j>l ca3C iidd the following new otoimQ 5 8 131: ^ 

58. (new) The method of claim 1, wherein the one or more product strings of (vi) 
have greater than 50% sequence identity with the at least one initial character string. 

59. (new) The method of claim 1 , wherein the one or more product strings of (vi) 
have greater than 75% sequence identity with the at least one initial character string. 

60. (new) The method of claim 1 , wherein the one or more product strings of (vi) 
have greater than 85% sequence identity with the at least one initial character string. 

6 1 . (new) The method of claim 1 , wherein the one or more product strings of (vi) 
have greater than 90% sequence identity with the at least one initial character string. 

62. (new) The method of claim 1 , wherein the one or more product strings of (vi) 
have greater than 95% sequence identity with the at least one initial character string. 

63. (new) The computer program product of claim 1 7, wherein the one or more 
product strings of (vi) having greater than 50% sequence identity with the at least one initial 
character string. 
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64. (new) The computer program product of claim 17, wherein the one or more 
product strings of (vi) having greater than 75% sequence identity with the at least one initial 
character string. 

65. (new) The computer program product of claim 17, wherein the one or more 
product strings of (vi) having greater than 95% sequence identity with the at least one initial 
character string. 

66. (new) A method of identifying molecules for production, wherein the molecules 
are represented by concatenated strings, said method comprising: 

i) encoding two or more biological molecules into a data structure of initial character 
strings to provide a collection of two or more different initial character strings wherein each of 
said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings fiom said initial character strings; 

iii) concatenating said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adding the product strings to a data structure to populate a data structure of product 

strings; 

v) providing an alignment of the product strings; and 

vi) selecting one or more product biological molecules for production, wherein the one 
or more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

67. (new) The method of claim 66, wherein said encoding comprises encoding two 
or more amino acid sequences into said character strings, and wherein said two or more amino 
acid sequences comprise an amino acid sequence encoding a naturally occurring protein. 

68. (new) The method of claim 66, wherein said initial character strings have at 
least 30% sequence identity with each other. 

69. (new) The method of claim 66, wherein said selecting in (ii) comprises selecting 
at least one substring fiom an initial character string such that the ends of said substring occur in 
string regions of about 3 to about 20 characters in the initial character string that have higher 
sequence identity with the corresponding region of another of said initial character strings than 
the overall sequence identity between the two initial character strings. 
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70. (new) The method of claim 66, wherein said selecting in (ii) comprises selecting 
substrings such that the ends of said substrings occur in predefined motifs of about 4 to about 8 
characters. 

71 . (new) The method of claim 66, wherein said selecting in (ii) comprises aUgning 
two or more of said initial character strings to maximize pairwise identity between two or more 
substrings of the initial character strings, and selecting a character that is a member of an aligned 
pair for the end of one of the two or more substrings. 

72. (new) The method of claim 66. wherein said method further comprises randomly 
altering one or more characters of said initial or product character strings. 

73. (new) The method of claim 66, wherein the one or more-product strings of (vi) 
^ having greater than 50% sequence identity with the at least one initial character string. 

74. (new) The method of claim 66, wherein the one or more product strings of (vi) 
having greater than 75% sequence identity with the at least one initial character string. 

75. (new) The method of clahn 66, wherein the one or more product strings of (vi) 
having greater than 85% sequence idaitity with the at least one initial characta: string. 

76. (new) The method of claim 66, wherein the one or more product strings of (vi) 
having greater than 90% sequence identity with the at least one initial character string. 

77. (new) The method of claim 66, wherem the one or more product strings of (vi) 
having greater than 95% sequence identity with the at least one mitial character string. 

78. (new) A computer program product on a computer readable media comprising 
computer code that: 

i) encodes two or more biological molecules into initial character strings to provide a 
collection of two or more different initial character strings wherein each of said biological 
molecules comprises at least about ten subunits; 

ii) selects at least two initial substrings ftom said character strings; 

iii) concatenates said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adds the product strings to a data structure to populate a data structure of product 

strings; 
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v) provides an alignment of the product strings; and 

vi) selects one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

79. (new) The computer program product of claim 78, wherem said computer code 
encodes two or more amino acid sequences into said character strings, and wherein said two or 
more amino acid sequences comprise an amino acid sequence encoding a naturally occurring 
protein. 

80. (new) The computer program product of claim 78, wherein said initial character 
strings have at least 30% sequence identity with each othen 

8 1 . (new) The computer program product of claim 78, wherein said computer code 
selects in (ii) at least one substring from an initial character string such that the ends of said 
substring occur in string regions of about three to about twenty characters in the initial character 
string that have higher sequence identity with a corresponding region of another of said initial 
characto- strings than the overall sequence identity between the two initial character substrings. 

82. (new) The computer program product of claim 78, wherein said computer code 
selects in (ii) by selecting substrings such that the ends of said substrings occur in predefined 
motifs of about 4 to about 8 characters. 

83. (new) The computer program product of claim 78, wherein said computer code 
selects in (ii) by aligning two or more of said initial character strings to maximize pairwise 
identity between two or more substrings of the initial character strings, and selecting a character 
that is a member of an aligned pair for the end of one of the two or more substrings. 

84. (new) The computer program product of claim 78, wherein said compute code 
further randomly alters one or more characters of said initial or product character strings. 

85 . (new) The computer program product of claim 78, wherein the one or more 
product strings of (vi) having greater than 50% sequence identity with the at least one initial 
character string. 

86. (new) The computer program product of claim 78, wherein the one or more 
product strings of (vi) having greater than 75% sequence identity with the at least one initial 
character string. 
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87. (new) The computer program product of claim 78, wherein the one or more 
product strings of (vi) having greater than 85% sequence identity with the at least one initial 
character string. 

88. (new) The computer program product of claim 78, wherein the one or more 
product strings of (vi) having greater than 90% sequence identity with the at least one initial 
character string. 

89. (new) The computer program product of claim 78, wherein the one or more 
product strings of (vi) having greater than 95% sequence identity with the at least one initial 
character string. 

90. (new) A method of identifying molecules for production, wherein the molecules 
are represented by concatenated strings, said method comprising: 

i) encoding two or more naturally occurring biological molecules into a data structure of 
initial character strings to provide a collection of two or more different initial character strings 

^ wherein each of said biological molecules comprises at least about 1 0 subunits; 

ii) selecting at least two substrings fiom said initial character strings; 

iii) concatenating said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adding the product strings to a data structure to populate a data structure of product 
strings; and 

v) selecting one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

91. (new) The method of claim 90, wherein said encoding comprises encoding two 
or more nucleic acid sequences into said character strings. 

92. (new) The method of claim 90, wherein said encoding comprises encoding two 
or more amino acid sequences into said character strings, and wherein said two or more amino 
acid sequences comprise an amino acid sequence encoding a naturally occurring protein. 

93. (new) The method of claim 90, wherein said initial character strings have at 
least 30% sequence identity with each other. 
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94. (new) The method of claim 90, wherein said selecting in (ii) comprises selecting 
at least one substring from an initial character string such that the ends of said substring occur in 
string regions of about 3 to about 20 characters in the initial character string that have higher 
sequence identity with the corresponding region of another of said initial character strings than 
the overall sequence identity between the two initial character strings. 

95. (new) The method of claim 90, wherein said selecting in (ii) comprises selecting 
substrings such that the ends of said substrings occur in predefined motifs of about 4 to about 8 
characters. 

96. (new) The method of claim 90, wherein said selecting in (ii) comprises aligning 
two or more of said initial character strings to maximize pairwise identity between two or more 
substrings of the initial character strings, and selecting a character that is a member of an aligned 
pair for the end of one of the two or more substrings. 

97. (new) The method of claim 90, wherein said method further comprises randomly 
altering one or more characters of said initial or product character strings. 

98. (new) The method of claim 90, wherein the one or more product strings of (v) 
having greater than 50% sequence identity with the at least one initial character string. 

99. (new) The method of claim 90, wherein the one or more product strings of (v) 
having greater than 75% sequence identity with the at least one initial character string. 

100. (new) The method of claim 90, wherein the one or more product strings of (v) 
having greater than 85% sequence identity with the at least one initial charact^ string. 

101. (new) The method of claim 90, wherein the one or more product strings of (v) 
having greater than 90% sequence identity with the at least one initial characto: string. 

1 02. (new) The method of claim 90, wherein the one or more product strings of (v) 
having greater than 95% sequence identity with the at least one initial character string. 

1 03. (new) A computer program product on a computer readable media comprising 
computer code that: 

i) encodes two or more naturally occurring biological molecules into initial character 
strings to provide a collection of two or more different initial character strings wherein each of 
said biological molecules comprises at least about ten subunits; 
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ii) selects at least two initial substrings from said character strings; 

iii) concatenates said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adds the product strings to a data structure to populate a data structure of product 
strings; and 

v) selects one or more product biological molecules for production, wherein the one or 
more product biological molecules correspond to one or more of the product strings having 
greater than 30% sequence identity with the at least one initial character string. 

104. (new) The computer program product of claim 1 03, wherein said computer code 
encodes by encoding two or more nucleic acid sequences into said character strings. 

105. (new) The computer program product of claim 103, wherein said computer code 
encodes two or more amino acid sequences into said character strings, and wherein said two or 
more amino acid sequences comprise an amino acid sequence encoding a naturally occurring 
protein. 

106. (new) The computer program product of claim 1 03, wherein said initial 
character strings have at least 30% sequence identity with each other. 

1 07. (new) The computer program product of claim 1 03, wherein said computer code 
selects in (ii) at least one substring bom an initial character string such that the ends of said 
substring occur in string regions of about three to about twenty characters in the initial character 
string that have higher sequence identity with a corresponding region of another of said initial 
character strings than the overall sequence identity between the two initial character substrings. 

108. (new) The computer program product of claim 103, wherein said computer code 
selects in (ii) by selecting substrings such that the ends of said substrings occur in predefined 
motifs of about 4 to about 8 characters. 

109. (new) The computer program product of claim 103, wherein said computer code 
selects in (ii) by aligning two or more of said initial character strings to maximize pairwise 
identity between two or more substrings of the initial character strings, and selecting a character 
that is a member of an aligned pair for the end of one of the two or more substrings. 

1 10. (new) The computer program product of claim 103, wherein said computer code 
further randomly alters one or more characters of said initial or product character strings. 
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111. (new) The computer program product of claim 103, wherein the one or more 
product strings of (v) having greater than 50% sequence identity with the at least one initial 
character string. 

1 12. (new) The computer program product of claim 103, wherein the one or more 
product strings of (v) having greater than 75% sequence identity with the at least one initial 
character string. 

1 13. (new) The computer program product of claim 103, wherein the one or more 
product strings of (v) having greater than 85% sequence identity with the at least one initial 
character string. 

1 14. (new) The computer program product of claim 103, wherein the one or more 
product strings of (v) having greater than 90% sequence identity with the at least one initial 
character string. 

1 15. (new) The computer program product of claim 103, wherein the one or more 
product strings of (v) having greater than 95% sequence identity with the at least one initial 
character string. 

1 16. (new) A method of identifying molecules for production, wherein the molecules 
are represented by concatenated strings, said method comprising: 

i) encoding two or more biological molecules into a data structure of initial character 
strings to provide a collection of two or more different initial character strings wherein each of 
said biological molecules comprises at least about 10 subunits; 

ii) selecting at least two substrings fiom said initial character strings; 

iii) concatenating said substrings to form one or more product strings about the same 
length as one or more of the initial character strings; 

iv) adding the product strings to a data structure to populate a data structure of product 

strings; 

v) obtaining one or more computationally predicted properties for the product strings in 
the data structure; and 

vi) selecting one or more product biological molecules for production on the basis of the 
one or more computationally predicted properties. 
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1 1 7. (new) The method of claim 1 1 6, wherein the computationally predicted 
properties comprise one or more of a maximum or minimum molecular weight, a maximum or 
minimum free energy, a maximum or minimum contact surface with a target molecule or 
surface, a specified net charge, a predicted pK, a predicted pi, a binding avidity, secondary form, 
and tertiary form. 

1 1 8. (new) The method of claim 1 1 6, wherein said encoding comprises encoding two 
or more amino acid sequences into said character strings. 

1 19. (new) The method of claim 116, wherein said selecting in (ii) comprises aligning 
two or more of said initial character strings to maximize pairwise identity between two or more 
substrings of the initial character strings, and selecting a character that is a member of an aligned 
pair for the end of one of the two or more substrings. 

120. (new) The method of claim 1 16, wherein said method fiuther comprises 
randomly altering one or more characters of said initial or product character strings. 

121. (new) The method of claim 1 1 6, wherein the one or more product biolojgical 
molecules of (vi) having greater than 50% sequence identity with the at least one initial character 
string. 

122. (new) The method of claim 1 1 6, wherein the one or more product biological 
molecules of (vi) having greater than 75% sequence identity with the at least one initial character 
strinjg. 

1 23 . (new) The method of claim 116, wherein the one or more product biological 
molecules of (vi) having greater than 90% sequence identity with the at least one initial character 
string. 

1 24. (new) A computer program product on a computer readable media comprising 
computer code that: 

i) encodes two or more biological molecules into initial character strings to provide a 
collection of two or more different initial character strings wherein each of said biological 
molecules comprises at least about ten subunits; 

ii) selects at least two initial substrings from said character strings; 

iii) concatenates said substrings to fonn one or more product strings about the same 
length as one or more of the initial character strings; 
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iv) adds the product strings to a data structure to populate a data structure of product 

strings; 

v) obtains one or more computationally predicted properties for the product strings in the 
data structure; and 

vi) selects one or more product biological molecules for production on the basis of the 
one or more computationally predicted properties. 

125. (new) The computer program product of claim 124, wherein the computationally 
predicted properties comprise one or more of a maximum or minimum molecular weight, a 
maximum or minimum free energy, a maximum or minimum contact surface with a target 
molecule or surface, a specified net charge, a predicted pK, a predicted pi, a binding avidity, 
secondary fomi, and tertiary form. 

126. (new) The computer program product of claim 124, wherein the con^)uter code 
encodes in (i) by encoding two or more amino acid sequences into said character strings. 

127. (new) The computer program product of claim 124, wherein the computer code 
selects in (ii) by aligning two or more of said initial character strings to maximize pairwise 
identity between two or more substrings of the initial character strings, and selecting a character 
that is a member of an aligned pair for the end of one of the two or more substrings. 

128. (new) The computer program product of claim 124, wherein the computer code 
further randomly alters one or more characters of said initial or product character strings, 

129. (new) The computer program product of claim 124, wherein the one or more 
product biological molecules of (vi) having greater than 50% sequence identity with the at least 
one initial character string. 

130. (new) The computer program product of claim 124, wherein the one or more 
product biological molecules of (vi) having greater than 75% sequence identity with the at least 
one initial character string. 

131. (new) The computer program product of claim 124, wherein the one or more 
product biological molecules of (vi) having greater than 90% sequence identity with the at least 
one initial character string. 
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